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We consider a number of prior probability distributions of particular interest, all being denned 
on the three-dimensional convex set of two-level quantum systems. Each distribution is — following 
recent work of Petz and Sudar — taken to be proportional to the volume element of a monotone 
metric on that Riemannian manifold. We apply an entropy-based test (a variant of one recently 
developed by Clarke) to determine which of two priors is more noninformative in nature. This 
involves converting them to posterior probability distributions based on some set of hypothesized 
outcomes of measurements of the quantum system in question. It is, then, ascertained whether or 
not the relative entropy (Kullback-Leibler statistic) between a pair of priors increases or decreases 
when one of them is exchanged with its corresponding posterior. The findings lead us to assert that 
the maximal monotone metric yields the most noninformative prior distribution and the minimal 
monotone (that is, the Bures) metric, the least. Our conclusions both agree and disagree, in certain 
respects, with ones recently reached by Hall, who relied upon a less specific test criterion than our 
entropy-based one. 
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At the outset of this letter, let us note that subsequent to an earlier version of it ( quant-ph/9703012 ), Hall commu- 
£h nicated a study Q, having quite similar (though perhaps more clearly articulated) objectives. Nevertheless, in terms 
of methodologies and conclusions, the two studies appear to differ significantly. We shall indicate these interesting 
t-^ ' points of agreement and disagreement. 

Let us reiterate the fundamental question motivating Hall, which he states at the beginning of his presentation: 
"what statistical ensemble corresponds to minimal prior knowledge about a quantum system? Such an ensemble may 
q-( be identified as the most random ensemble of possible states of the system. It would provide, for example, a natural 
• • ■ benchmark for assessing how "random" a given evolution process is; a worst-case scenario for general schemes for 
extracting information about the system; and a natural unbiased measure over the set of possible states of the system 
(which would allow one to calculate, e. g., the average effectiveness of a general scheme for distinguishing between 
^ ' quantum states)" . 

In our work presented below, we have relied upon a specific entropy-based test (related to one recently developed 
by Clarke |^]) to address these issues Hall has raised. The conclusions of Hall themselves, on the other hand, rest on 
the less specific (and apparently, from our point of view, not determinative) proposition that "maximal randomness 
corresponds to an ensemble with maximal symmetry." In fact, we will argue that Hall's minimal-knowledge ensemble 
(based on the Bures metric j3|) is not truly minimal. However, we do agree with Hall that this particular ensemble is 
superior in information terms to that associated with a uniform distribution (as previously employed by Larson and 
Dukes [Q) over the convex set (Bloch sphere ||) of two-level quantum systems. 

Brody and Meister Q recently studied the problem of deciding between two a priori possible two-level quantum 
mechanical pure states. They assert that "if prior knowledge is not available, one can still employ the Bayesian 
approach, using a noninformative prior. However, the analysis of such cases is beyond the scope of the present 
Letter." In this study, we do consider the problem of determining an appropriate noninformative prior — for the 
more general situation, in which advance information regarding the degree of purity of the unknown system is lacking. 
("In a real situation one can never design a preparator such that it produces an ensemble of identical pure states. 
What usually happens is that the ensemble consists of a set of pure states each of which is represented in the ensemble 
with a certain probability" 0.) 

As noted, we adapt recent work of Clarke Q to provide us with an operational criterion for deciding which of two 
priors is more noninformative in nature. We apply our test to a number of priors, all of which (with the exception 
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of the uniform distribution of Larson and Dukes ) are obtained by normalizing the volume elements of monotone 
metrics [||-@ . We find that the maximal monotone metric (of the left logarithmic derivative — which does not exist 
for pure states) yields the most noninformative prior of those examined, but only if we first rule out the possibility that 
the unknown two-level system is in a pure or nearly pure state. (Jones [fl2| has considered the somewhat opposite 
situation, in which the unknown system is definitely in a pure state.) We also find — in strong contrast to Hall 
jlj — that the minimal monotone metric (of the symmetric logarithmic derivative) yields the least noninformative 
prior of those examined (other than the uniform one of Larson and Dukes Q] ) . It also appears that our (Bayesian) 
notion of noninformativity is equivalent to the (classical) one of Petz and Toth pi], p. 215] in their work comparing 
(Cramer-Rao-type) lower bounds for the variances/covariances of unbiased estimators of the parameters of quantum 
systems. 



In (classical/nonquantum) Bayesian theory |13|, in the absence of any information regarding the specific values of 
the parameters of a family of probability distributions, one uses a noninformative (Jeffreys) prior (cf. [ ^6|Jr^ |). This is 
taken to be proportional to the volume clement of the unique monotone (Fisher information) metric on the Riemannian 
manifold formed by the family. "An infinitesimal statistical distance has to be monotone under stochastic mappings" 
|, p. 786]. Petz and Sudar § - building upon work of Morozova and Chentsov Jl8| — have recently shown 
that, in the quantum/noncommutative case, there is not a single monotone metric, but rather a nondenumerable 
number of such metrics. Each corresponds to an operator monotone function f{t) satisfying the normalization and 
symmetrization conditions, f(l) = 1, f(t) = t/(i _1 ). (A function / : 5i + — > 3? is called operator monotone if the 
relation < K < H, meaning that H — K is semipositive definite, implies < f(K) < f(H), for any matrices K and 
H of arbitrary order |lS[|.) "Therefore, more than one privileged metric shows up in quantum mechanics. The exact 
clarification of this point requires and is worth further studies" || p. 2672]. This Letter seeks to contribute to such 
a clarification (cf. ^0|). We, thus, restrict our considerations to priors which are proportional to volume elements 
of monotone metrics. Although this still leaves a nondenumerable number of candidate priors, our results indicate 
that certain ones (in particular, that obtained from the maximal monotone metric) can be distinguished for their 
information-theoretic properties. 

II. ANALYSIS 

A. Two prior probability distributions 

We begin our analysis by examining two monotone metrics of particular interest || — the Kubo-Mori metric (given 
by f(t) = (t — l)/logt) and the minimal monotone SLD (symmetric logarithmic derivative) "Bures-type" |2l]] metric 
(given by f(t) = (1 + t)/2). In recent papers |2^] (cf. [g0[23|), the author has proposed and analyzed the use of a 



prior probability distribution, 

(l-x 2 -y 2 -z 2 )-^ 2 /n 2 . (1) 

This is the normalized form of the volume element of the SLD-mctric over the Bloch sphere |24|] — the unit ball in 
three-space, comprised of the points x 2 + y 2 + z 2 < 1 — of 2 x 2 density matrices, 

1 ( 1 + z x - iy\ (2) 

2 \x + iy 1 — z J ' ^ ' 

In spherical coordinates (x = r cos </> sin 9,y = r sin </> sin 6,z — r cos 9) , (Q) takes the form, 

PSLD(r, 9, 0) = r 2 (l - r 2 )- 1 ' 2 sin^Tr 2 . (3) 

On the other hand, use of the Kubo-Mori metric yields a prior probability distribution, 

Pkm{t, 9, <t>) = r(l - r 2 )- 1 ' 2 log[(l + r)/{\ - r)} sm9/4n 2 . (4) 

The distributions (0) and (|J) are obtained by substituting the corresponding operator monotone functions given above 
into the formula (pTeq, 3.17], 

r 2 (l - r 2 )- x / 2 (l + r)- 1 sin0//[(l - r)/(l + r)], (5) 

and, then, normalizing. Both pkm and psld are monotonically increasing with r, with pkm assigning greater 
probability to systems that are nearly pure (r > .957504) and, compensatingly less, to relatively mixed systems 
(r < .957504). 
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We, first, compare the suitabilities of psld and Pkm as possible noninformative or "reference" priors |13| for the 
quantum inference or estimation of an unknown two-level system. (Appropriate informative priors should, of course, 
be used if specific knowledge regarding the parameters of the system is available ||.) We modify (as explained below) 

- in any case, doing so apparently, at least in our context, leads to no substantive differences — a general line of 
reasoning recently elaborated upon by Clarke M. We note, in this regard, that the relative entropy (Kullback-Leibler 
number or information gain or directed divergence) |]l3| of psld with respect to pkm, that is, 

D(psld || Pkm) — III PSLD^og[psLD/PKM]d4>d9dr (6) 
Jo Jo Jo 

(the natural logarithm is understood throughout this communication) is .0891523 "nats" (1 nat equals ~ 1-4227 
bits), while D(pkm \\ Psld) is .0975976 nats. It proves possible to reduce the former statistic by incorporating 
certain information into our considerations, but not the latter. This leads us to the conclusion that pkm is more 
noninformative than psld- 

After some initial numerical experimentation, we were led to assume that six spin measurements had been performed 

- two in the X—, two in the Y-, and two in the Z-direction — using six replicas of a spin-1/2 system and that 
for each of these pairs we obtained one "up" and one "down" . (It would be of interest to conduct a parallel series of 
analyses to that reported below, based on what have been found to be optimal sets of measurements [ 14| , |l5[ . ) Then 

- applying Bayes' Theorem |l3| — we converted psld and pkm to posterior distributions by multiplying them by 
the likelihood of such a set of six outcomes |^,|l2| , 

(1 - x 2 )(l - y 2 )(l - z 2 )/64 = [(1 - x)/2}[(l + x)/2}[(l - y)/2][(l + „)/2][(l - z)/2][(l + z)/2], (7) 

and normalizing the resulting products over the Bloch sphere. (The normalization factors are 64 x 192/71 in the SLD- 
case and 64 x 19600/6047 in the KM-case.) The relative entropy of psld with respect to the ifM-posterior is, then, 
reduced, as a result of the added information, from .0891523 to .0720681. On the other hand, the relative entropy of 
Pkm with respect to the SXD-posterior is increased dramatically from .0975976 to .457259. Paraphrasing Clarke 0, p. 
173], "[psld] is already more informative than [pkm], so we cannot make it less informative by adding information". 
However, if we were to replace the likelihood (|?]) by its square — that is, in effect, assume twelve measurements, 
giving two "ups" and two "downs" in each of the three mutually orthogonal directions — then the relative entropy of 
Psld with respect to the corresponding revised or updated K M -posterior would not further decrease from .0720681, 
but would increase to .334699. Thus, the informativity of psld with respect to pkm is limited, in this manner. (In 
an approximate sense, then, the information contained in psld can be described as that in pkm with the addition of 
that gained by knowledge of the outcomes of the six measurements.) 

If we had conformed strictly to the line of argument of Clarke [0, we would have exchanged the positions of the 
priors and posteriors in the relative entropy statistics reported above. Nevertheless, it seems rather evident that — in 
the context of the present study — we would reach the same fundamental conclusions if we had done so. For example 
(again, based on the same six measurement outcomes), the relative entropy of the K M-posterior with respect to 
Psld is .0603743 (cf. .0720681) and that of the SXD-posterior with respect to p KM is .399442 (cf. .457259). (Even 
though, relative entropy "is not a true distance between distributions since it is not symmetric and does not satisfy 
the triangle inequality... it is often useful to think of relative entropy as a 'distance' between distributions" p. 18]). 
Our initial rationale for not simply following Clarke's scheme was based, among other things, on cetain preliminary 
evidence that computations would be considerably simplified if we averaged the logarithms of the ratios of the two 
probability distributions in question with respect to (spherically-symmetric) priors and not (asymmetric) posteriors. 
Since the supports of the priors and posteriors are essentially identical here, that is the Bloch sphere — except for 
possibly the six isolated points, (±1, 0, 0), (0, ±1, 0) and (0, 0, ±1), having measure zero — our variation will not lead 
to divergent integrals. However, since the support of a posterior, in general, may be a measurably smaller subset of 
the support of the corresponding prior (due to the likelihood being null) , the scheme of Clarke and not our variation 
should be followed, as a rule. (For those priors studied here which are defined over the entire Bloch sphere, the 
likelihood (0) is zero at the six mentioned points. However, all these priors — except for the uniform distribution 
(pld) — are infinite at those points also.) Later developments appeared to, in fact, indicate that we would not have 
paid a significant computational penalty in fully adhering to the approach of Clarke, ab initio. 

Let us present two additional information-theoretic statistics consistent with the proposition that pkm is more 
noninformative than psld- The information gain of the iCM-posterior (based on the same hypothetical six observa- 
tions) with respect to pkm itself is .151575, while the analogous SXD-result is less, y|§| -I- log ^- ~ .140862. Also, a 
single spin measurement yields an information gain of .157404 with respect to pkm : but less (| — log 2 w .140186) 
with respect to psld- 
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B. Two additional prior probability distributions 



Using a recursively denned double sequence, Petz jlo|, eq. 21] has arrived at the operator monotone function, 

m (i+mogtr (8) 

the operator mean of which had been found by Morozova and Chentsov fl8|| . Normalizing, using numerical methods, 
the volume element (|^) of the associated monotone metric, we obtain the following prior probability distribution, 

PMc{r,Q,ct>) = .00513299(1 - r 2 )- 1 ' 2 {\og[{l - r) /{I +r)]f sin 6. (9) 

Now, D(pkm II Pmc) = -112421 and D(pmc II Pkm) = .117982. These statistics are transformed to .106655 and 
.482023, respectively, when they are computed with respect to the corresponding MC and ifM-posteriors, both 
based on the previously hypothesized set of six outcomes. So, we can assert — by our rule — that pmc is itself more 
noninformative than pkm- (Strict adherence to the scheme of Clarke yields the corresponding statistics, .0910048 
and .452794, leading to the very same conclusion. It appears, however, that our procedure yields somewhat larger 
statistics than does Clarke's.) This is consistent with the earlier result, in the sense that pmc assigns greater 
probability to some set of nearly pure systems (r > .9846) than does pkm- Assuming that noninformativity is a 
transitive relation, one would expect to find that pmc is also more noninformative than psld- This proves to be the 
case, as D(psld II Pmc) = -388323 and D(p MC \\ Psld) — -445981. The posterior version of these statistics are, 
.186964 and .991175, respectively, with puc assigning greater probability than psld to systems with r > .973932. 
Larson and Dukes [Q have utilized a uniform prior, 

Pld{v, 6, 4>) = 3r 2 sin(9/47r, (10) 

over the Bloch sphere of two-level quantum systems. "The simplest prior which does not confine itself to pure states 
assigns equal probability to equal infinitesimal volumes within the unit sphere of the geometricalparameterization. 
(This is a physically reasonable prior, since the geometrical parameterization is metrically faithful" Hall |j| obtains 
the uniform distribution by two distinct arguments, one based on randomly correlated ensembles, and the other on 
the Hilbert-Schmidt metric.) 

Although pld can be written as proportional to the volume element (|5|), using f(t) = (1 + t) 2 /\/t, this particular 
function is not operator monotone, so pld does not, in fact, correspond to a monotone metric. We can, nevertheless, 
examine its informative properties. We have that D(pld || Pmc) — 1-07895 and D(pmc II Pld) — 1.98719. These 
statistics are transformed to .559829 and 2.79851, respectively, if one replaces the second probability distribution 
in each formula with its posterior counterpart based on the set of six outcomes previously hypothesized (]?]). These 
results are, then, consistent with the earlier patterns, since pmc assigns greater probability than pld to systems with 
r > .948724. By assuming a doubling or repeating of the set of six measurements — involving the squaring of the 
likelihood (0) — we are able to reduce the statistic .559829 still further, to .310686. while a tripling (corresponding 
to eighteen measurements) yields less still, that is .307632. However, use of a posterior based on twenty-four such 
measurements results in .529577. So, it might be asserted that pld is considerably more informative than pmc- 

The variance of z, that is (z 2 ), is .301762 for puci -277778 for pkm, -25 for psld and .2 for pld, thus, agreeing 
in order with the relative noninformativities of these priors. In the relative ranking of psld and pld, we are in full 
agreement with Hall (l). Hall's argument that psld generates the maximally random ensemble is that "the Bures 
metric for a two-dimensional system corresponds to the surface of a unit 4-ball, i. e., to the maximally symmetric 
3-dimensional space of positive curvature . . . This space is homogenous and isotropic, and hence the Bures metric does 
not distinguish a preferred location or direction in the space of density operators." Although we can not disagree with 
these statements, they do not appear to be determinative in judging the relative noninformativity of prior distributions 
over the two-level quantum systems. 



C. Two truncated prior probability distributions 

We now proceed to find two priors more noninformative than pmc , but only by imposing a restriction on the a priori 
possible two-level systems — that is, we must eliminate the possibility that the unknown system under examination 
is either in a pure or "nearly" pure state. We consider the three operator monotone functions [p|,^0|, 

(2 — Til 

2t 2 (t — i) n 

f(t) = — ± n = 0,l,2 (11) 

3 v ' (1 + t)(logt)" ' ' K ' 
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For n — (corresponding to the maximal monotone metric [of the left logarithmic derivative] pJlC|] ) and n = 1 , the 
volume elements (|3|) are improper, that is not normalizable over the Bloch sphere, while for n = 2, the corresponding 
volume element is proper or normalizable, corresponding to the operator monotone function (^) and probability 
distribution pmc, that is (||). To directly compare the three metrics based on ([ll]), we choose to normalize their 
volume elements over a three-dimensional ball of radius R = 1 — 10~ 10 . We, consequently, obtain the three probability 
distributions (p„) over the so-truncated convex set (not containing the pure [r = 1] and nearly pure [1 > r > R] 
states), 

po = .00000112542r 2 (l - r 2 )~ 3/2 sin0, (12) 



pi = .000569121r(l - r 2 )" 1 log[(l + r)/(l - r)\ sin0, (13) 
p 2 = .00513611(1 - r 2 )~ 1/2 log[(l -r)/(l + r)] 2 sin(9. (14) 



We have that the relative entropy or information gain of po with respect to p\ , that is [cf. 




D{po\\pi)= / / / Po log[p /pi]d(j)d9dr (15) 
Jo Jo Jo 

equals .867442 nats. Also, D(p || p 2 ) =5.76086, D{ Pl || p ) = 1.654, D{ Pl || p 2 ) = 2.37198, D(p 2 || p ) =7.06816, and 
D(p 2 II Pi) = 1.52109. 

We assume now — again following our general methodology — that we have performed a two-level measurement 
on each of six replicas of a two-level quantum system, two measurements in each of three orthogonal (X, Y, Z) 
directions, and obtained a single "up" and a single "down" in each direction. Multiplying the likelihood of 
such an occurrence by po,pi and p 2 , in turn, and normalizing the products over the truncated Bloch sphere (of 
radius R = 1 — 10~ 10 ) — with normalization factors, 335.987, 327.546 and 249.378, respectively — we obtain (in 
accordance with Bayes' Theorem Jl3|]) three posterior distributions, which we will designate as Pq,Pi and P 2 , in 
the obvious fashion. Now, we have that D(p Q || P ± ) = 1.07576, D(p || P 2 ) = 6.24184, D( Pl || P ) = 1.53564, 
D(pi || P 2 ) = 2.55172, D(p 2 || P ) = 6.94979, and D(p 2 \\ P x ) = 1.42817. 

Our variation of the criterion elaborated upon by Clarke j2| (in which we have exchanged the positions of priors 
and posteriors in the relative entropy statistics), then leads us to conclude that po is more noninformative than both 
pi and p 2 and that p\ is more noninformative than p 2 . According to our rule, pi is more noninformative than pj 
if both D(pi |j Pj) > D(pi || pj) and D(pj \\ Pi) < D(pj \\ pi). For instance, for i = and j — 1, we have that 
D(p || Pi) = 1.07576 > D(p \\ pi) = .867442 and D{ Pl \\ P ) = 1.53564 < D( Pl || p ) = 1.654. Thus, by 
adding information to po — in the form of the six hypothesized measurement results — we are able to more closely 
approximate p\ , but apparently not vice versa. It would be of interest to study the changes in the statistics given 
above as R — * 1. The ratio of po to p\ at r = R= 1 — 10~ 10 is 5.89521, while that of po to p 2 is 1947.41, and that of 
Pi to p 2 is 330.338. These results are in accordance with those above, in that more noninformative priors were also 
found there to assign greater probability to more nearly pure states. 



III. DISCUSSION 

Petz and Toth jn], p. 215] found that the lower bound on the covariance matrix of unbiased estimators of parameters 
provided by use of the symmetric logarithmic derivative or minimal montone metric was "more informative" (that is, 
tighter) than the bound furnished by the Kubo-Mori (Bogoliubov) monotone metric. This bound, in turn, was tighter 
than that obtained from the maximal monotone metric. Their notion of informativity appears to be fully consistent 
with that derived here through an apparently quite different line of argument. We assert this since we have found 
that the noninformativity of psld is less than that of pkm, and that oIpkm is less than that of pmCi while finally, 
the noninformativity of the truncated form oipMCi that is p 2 , is less than that of po (based on the maximal monotone 
metric) . 

Derka, Buzek and Adam j7| (cf. ^6|) approach the problem of using Bayesian reasoning to reconstruct a two-level 
quantum system that is possibly impure, by assuming that the unknown system is coupled to a second system (which 
they, for convenience, take to be two- level in nature), and that the pair of coupled systems is in a pure state. They 
utilize the invariant integration measure on all possible such (four-dimensional) pure states. Hall |lj adopts a related 



approach in the first part of his paper. In contrast, the analysis here and in [£2 23 does not posit any such coupling 
of the unknown system, regarding it, in effect, as independent or autonomous. 
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A most interesting question that remains to be formally addressed is whether or not it is possible to find two 
different sets of measurements, one of which leads to a conclusion that a prior p is more noninformative than another 
prior q, while the other set leads to an opposite deduction. Of course, if such hypothetical sets were to exist, which we 
presume they do not, the validity of the general line of argument presented in this Letter would be called into question. 
Another line of investigation that would be of interest to pursue would be the determination of the particular set of 
measurements that minimizes the relative entropy between the corresponding posterior form (P) of p and q itself (cf. 
Jl^] ) . Such a set of minimizing measurements could be said to best express the additional information contained in 
q, above and beyond that in p. (Clarke § argues that one can, in general, find a data set to minimize the relative 
entropy of the corresponding posterior with respect to the "informative" prior in question.) 

In summary, based on our analysis here, we would recommend for the Bayesian inference of the parameters of an 
unknown two-level quantum system the use of pmc as a prior or, preferably, some version (possibly modifying our 
choice of R) of po, if one has a priori knowledge that the unknown system is described by a polarization vector of 
length yjx 2 +y 2 + z 2 < R < 1. If we view po as a function of R, reexpress it in terms of Cartesian coordinates, 
integrate over z, say, and take the limit R — ► 1, we obtain the bivariate marginal probability distribution over the 
unit disk, 

(1 - x 2 - y 2 )- 1/2 /2w. (16) 

(The bivariate marginal distribution of psld, on the other hand, is the uniform one — 1/tt — over the unit disk. 
In p3[ | the distribution ( |l6| ) was obtained from the Jeffreys prior for the family of trivariate normal distributions, 
with null mean vectors, having the density matrices (^|) as their covariance matrices.) Thus, if one is content to 
estimate only two of the three parameters determining a two-level system, one can employ ( [To] ) as a prior and avoid 
having to rule out the possibility of a pure or nearly pure state ]23| . The distribution ( |l6|) is precisely the standard 
(classical/nonquantum) noninformative (Jeffreys) prior for the two-parameter family of trinomial distributions with 
probabilities x 2 , y 2 and 1 — x 2 — y 2 [[l3 23 1. The conditional distribution over [-1,1] of a; in ( |l6| ) — given that y = 



is the "cosine distribution" 

(l-x 2 )- 1/2 /ir. (17) 
(In the statistical literature this is termed the arc-sine distribution.) It is the noninformative (Jeffreys) prior for the 



one-parameter family of binomial distributions with probabilities x 2 and 1 — x 2 |l3j , |23[ ]. The distributions (16) and 
( |i~7| ) can also be obtained as conditional distributions of psld — which itself is the Jeffreys prior for the family of 
quadrinomial distributions with probabilities x 2 ,y 2 jZ 2 and 1 — x 2 — y 2 — z 2 . (This corresponds to the geometry of a 
three-dimensional hemisphere, as pointed out in [|l]||,Q (cf. |2g|l).) 

For additional quantum applications of other (classical) work — besides — of Clarke (jointly with Barron) 
p9| , [30| , having relevance to comparative properties of the volume elements of the monotone metrics for the two-level 
quantum systems (cf. ||), we refer the reader to [ p0[ . For a further application — this one to spin-1 systems — in 
which the comparative properties of priors based on the minimal and maximal monotone metrics are assessed (with 
the maximal one again displaying a certain superiority — greater computational tractability), see pl|. 
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